Skip to content

[AMD] Optimize search space and upgrade Image to 0.19.0 for MiniMax-M2.5#1003

Merged
cquil11 merged 8 commits intomainfrom
chun-taoyu/minimaxm2.5-fp8-single-node
Apr 5, 2026
Merged

[AMD] Optimize search space and upgrade Image to 0.19.0 for MiniMax-M2.5#1003
cquil11 merged 8 commits intomainfrom
chun-taoyu/minimaxm2.5-fp8-single-node

Conversation

@chunfangamd
Copy link
Copy Markdown
Collaborator

@chunfangamd chunfangamd commented Apr 5, 2026

  • Optimize MiniMax-M2.5 FP8 MI355X vLLM search-space
  • Upgrade Image to v0.19.0
  • Enable FP8 KV cache + AITER FA (It is important to recheck this when upgrading to higher image versions)

e2e Test run: https://github.com/SemiAnalysisAI/InferenceX/actions/runs/23987768210

co-author: @benenzhu

Fewer GPUs means less inter-GPU communication overhead, and MoE
expert parallelism across 2 GPUs is very efficient for this model.
Enable FP8 KV cache + AITER FA for minimaxm2.5-fp8-mi355x-vllm
@github-actions
Copy link
Copy Markdown
Contributor

github-actions bot commented Apr 5, 2026

Thanks for the contribution! For vLLM & SGLang, please ensure that your recipes is similar to the official vLLM recipes and/or the SGLang cookbook

If it is not, please create a PR first before we can merge your PR into the master branch. Let's ensure that the documentation is first class such that the entire ML community can benefit from your hard work! Thank you

1 similar comment
@github-actions
Copy link
Copy Markdown
Contributor

github-actions bot commented Apr 5, 2026

Thanks for the contribution! For vLLM & SGLang, please ensure that your recipes is similar to the official vLLM recipes and/or the SGLang cookbook

If it is not, please create a PR first before we can merge your PR into the master branch. Let's ensure that the documentation is first class such that the entire ML community can benefit from your hard work! Thank you

Copy link
Copy Markdown
Contributor

@functionstackx functionstackx left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

lgtm once validation passes

@benenzhu
Copy link
Copy Markdown
Collaborator

benenzhu commented Apr 5, 2026

vllm recipes pr updated: vllm-project/recipes#300

@cquil11 cquil11 enabled auto-merge (squash) April 5, 2026 16:59
@cquil11 cquil11 disabled auto-merge April 5, 2026 16:59
@cquil11 cquil11 merged commit 0ee5ff3 into main Apr 5, 2026
96 of 97 checks passed
@cquil11 cquil11 deleted the chun-taoyu/minimaxm2.5-fp8-single-node branch April 5, 2026 16:59
@cquil11 cquil11 changed the title Optimize search space and upgrade Image to 0.19.0 for MiniMax-M2.5 [AMD] Optimize search space and upgrade Image to 0.19.0 for MiniMax-M2.5 Apr 8, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

Development

Successfully merging this pull request may close these issues.

4 participants